Compositional Preferences in Quadruplets of Nearest Neighbor Residues in Protein Structures: Statistical Geometry Analysis
نویسندگان
چکیده
Three-dimensional structure and amino acid sequence of proteins are related by an unknown set of rules that is often referred to as the folding code. This code is believed to be significantly influenced by nonlocal interactions between the residues. A quantitative description of nonlocal contacts requires the identification of neighboring residues. We applied statistical geometry approach to analyze the patterns of spatial proximity of residues in known protein structures. Structures from a dataset of well resolved nonhomologous proteins with a single point representation of residues by Cα atoms were tessellated using Delaunay algorithm. The Delaunay tessellation generates an aggregate of space-filling irregular tetrahedra, or Delaunay simplices. The vertices of each simplex objectively define four nearest neighbor Cα atoms and therefore four nearest neighbor residues. Compositional analysis of Delaunay simplices reveals highly nonrandom clustering of amino acid residues in protein structures. Relative abundance or deficiency of residue quadruplets with certain compositions reflects propensities of different types of amino acids to be associated or disassociated in folded proteins. The likelihood of occurrence of four residues in one simplex displays strong nonrandom signal also in the case of a reduced amino acid alphabet. We used several different three-letter alphabets based on the residue chemical and structural properties and on the complementarity of the corresponding codons. In all cases the clustering of residues correlates with their properties or genetic origin. The results of this analysis are being implemented in algorithms for protein structure classification and prediction.
منابع مشابه
Statistical geometry analysis of proteins: implications for inverted structure prediction.
The topology of folded proteins from the representative dataset of well-defined three-dimensional protein structures is studied using a statistical geometry approach. Amino acid residues in protein chains are represented by C alpha atoms, thus reducing the protein three-dimensional structure to a set of points in three dimensional space. The Delaunay tessellation of a protein structure generate...
متن کاملDelaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues
Delaunay tessellation is applied for the first time in the analysis of protein structure. By representing amino acid residues in protein chains by C alpha atoms, the protein is described as a set of points in three-dimensional space. Delaunay tessellation of a protein structure generates an aggregate of space-filling irregular tetrahedra, or Delaunay simplices. The vertices of each simplex defi...
متن کاملDevelopment of a four-body statistical pseudo-potential to discriminate native from non-native protein conformations
MOTIVATION Most scoring functions used in protein fold recognition employ two-body (pseudo) potential energies. The use of higher-order terms may improve the performance of current algorithms. METHODS Proteins are represented by the side chain centroids of amino acids. Delaunay tessellation of this representation defines all sets of nearest neighbor quadruplets of amino acids. Four-body conta...
متن کاملNon-zero probability of nearest neighbor searching
Nearest Neighbor (NN) searching is a challenging problem in data management and has been widely studied in data mining, pattern recognition and computational geometry. The goal of NN searching is efficiently reporting the nearest data to a given object as a query. In most of the studies both the data and query are assumed to be precise, however, due to the real applications of NN searching, suc...
متن کاملEFFECT OF THE NEXT-NEAREST NEIGHBOR INTERACTION ON THE ORDER-DISORDER PHASE TRANSITION
In this work, one and two-dimensional lattices are studied theoretically by a statistical mechanical approach. The nearest and next-nearest neighbor interactions are both taken into account, and the approximate thermodynamic properties of the lattices are calculated. The results of our calculations show that: (1) even though the next-nearest neighbor interaction may have an insignificant ef...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998